AITopics | network training

Adaptive Meta-Learning Stochastic Gradient Hamiltonian Monte Carlo Simulation for Bayesian Updating of Structural Dynamic Models

Meng, Xianghao, Beck, James L., Huang, Yong, Li, Hui

arXiv.org Machine LearningApr-29-2026

In the last few decades, Markov chain Monte Carlo (MCMC) methods have been widely applied to Bayesian updating of structural dynamic models in the field of structural health monitoring. Recently, several MCMC algorithms have been developed that incorporate neural networks to enhance their performance for specific Bayesian model updating problems. However, a common challenge with these approaches lies in the fact that the embedded neural networks often necessitate retraining when faced with new tasks, a process that is time-consuming and significantly undermines the competitiveness of these methods. This paper introduces a newly developed adaptive meta-learning stochastic gradient Hamiltonian Monte Carlo (AM-SGHMC) algorithm. The idea behind AM-SGHMC is to optimize the sampling strategy by training adaptive neural networks, and due to the adaptive design of the network inputs and outputs, the trained sampler can be directly applied to various Bayesian updating problems of the same type of structure without further training, thereby achieving meta-learning. Additionally, practical issues for the feasibility of the AM-SGHMC algorithm for structural dynamic model updating are addressed, and two examples involving Bayesian updating of multi-story building models with different model fidelity are used to demonstrate the effectiveness and generalization ability of the proposed method.

artificial intelligence, bayesian inference, machine learning, (18 more...)

arXiv.org Machine Learning

doi: 10.1016/j.cma.2025.117753

2604.2571

Country: North America > United States (0.28)

Genre:

Research Report (1.00)
Workflow (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

09b6e009612875dd0a7291d5f4fd8b49-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 11:30:24 GMT

artificial intelligence, cross-image feature, machine learning, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

New Complexity-Theoretic Frontiers of Tractability for Neural Network Training

Neural Information Processing SystemsFeb-16-2026, 14:16:52 GMT

A neural network (cf. Figure 1) can be thought of as a directed acyclic network consisting of

architecture, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Maryland > Baltimore (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(9 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)

Add feedback

LCA: Loss Change Allocation for Neural Network Training

Janice Lan, Rosanne Liu, Hattie Zhou, Jason Yosinski

Neural Information Processing SystemsFeb-14-2026, 10:55:56 GMT

This rich view shows which parameters are responsible for decreasing or increasing the loss during training, orwhich parameters "help" or"hurt" the network'slearning, respectively.

artificial intelligence, lca, machine learning, (18 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.48)

Add feedback

bc943cd038a5531d5433b1431c822c01-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-11-2026, 15:47:58 GMT

dataset, part-template feature, pointnet, (17 more...)

Neural Information Processing Systems

Country: Asia > China > Shaanxi Province > Xi'an (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

b05b57f6add810d3b7490866d74c0053-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 20:46:46 GMT

arxiv preprint arxiv, gradient, noise, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.60)

Add feedback

0e1ebad68af7f0ae4830b7ac92bc3c6f-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-7-2026, 11:38:29 GMT

artificial intelligence, expansion strategy, machine learning, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.55)

Add feedback

043ab21fc5a1607b381ac3896176dac6-Paper.pdf

Neural Information Processing SystemsFeb-7-2026, 07:28:04 GMT

experiment, precision, relu null, (15 more...)

Neural Information Processing Systems

Country:

Europe > France > Occitanie > Haute-Garonne > Toulouse (0.05)
North America > Canada > Ontario > Toronto (0.04)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

How degenerate is the parametrization of neural networks with the ReLU activation function?

Neural Information Processing SystemsDec-24-2025, 23:21:50 GMT

Neural network training is usually accomplished by solving a non-convex optimization problem using stochastic gradient descent. Although one optimizes over the networks parameters, the main loss function generally only depends on the realization of the neural network, i.e. the function it computes. Studying the optimization problem over the space of realizations opens up new ways to understand neural network training. In particular, usual loss functions like mean squared error and categorical cross entropy are convex on spaces of neural network realizations, which themselves are non-convex. Approximation capabilities of neural networks can be used to deal with the latter non-convexity, which allows us to establish that for sufficiently large networks local minima of a regularized optimization problem on the realization space are almost optimal.

neural network, parametrization, relu activation function, (11 more...)

Neural Information Processing Systems

Technology: